PLP coefficients can be quantized at 400 bps
نویسندگان
چکیده
Previous work in wireless speech recognition has focused on two methods, namely, quantizing recognition features (e.g. MFCC) or performing recognition using speech coding parameters (e.g. LPC). All of this previous research assumes that the communication channel is only large enough to transmit either speech coding parameters or speech recognition parameters. By contrast, we propose that the speech recognition parameters can be quantized at a rate sufficiently low to allow transmission of both speech coding and speech recognition parameters over a standard cellular channel. In particular, this paper shows that the perceptual LPC (PLP) coefficients can be transmitted at 400 bps with an insignificant loss of digit recognition accuracy.
منابع مشابه
Channel Noise Robustness for Low-bitrat
In remote (or distributed) speech recognition , the recognition features are quantized at the client, and transmitted to the server via wireless or packet-based communication for recognition. In this paper, we investigate the issue of robustness of remote speech recognition applications against channel noise. The techniques presented include: 1) optimal soft decision channel decoding allowing f...
متن کاملEfficient quantization of LSF parameters based on temporal decomposition
In this paper, we present a restricted temporal decomposition method for LSF parameters. The event vectors estimated by this method preserve the ordering property of LSF parameters so that they can be quantized efficiently. Experimental results show that interpolated LSF parameters can be quantized transparently at the rate of 753 bps. We also design a LPC vocoder at 996 bps as an application o...
متن کاملA New Fast and Efficient HMM-Based Face Recognition System Using a 7-State HMM Along With SVD Coefficients
In this paper, a new Hidden Markov Model (HMM)-based face recognition system is proposed. As a novel point despite of five-state HMM used in pervious researches, we used 7-state HMM to cover more details. Indeed we add two new face regions, eyebrows and chin, to the model. As another novel point, we used a small number of quantized Singular Values Decomposition (SVD) coefficients as feature...
متن کاملSource and channel coding for remote speech recognition over error-prone channels
This paper presents source and channel coding techniques for remote automatic speech recognition (ASR) systems. As a case study, Line Spectral Pairs (LSP) extracted from the 6th order allpole Perceptual Linear Prediction (PLP) spectrum are transmitted and speech recognition features are then obtained. The LSPs, quantized using first-order predictive vector quantization (VQ) at 300 bps, provide ...
متن کاملExhaustive Generation and Visual Browsing for Radiation Patterns of Linear Array Antennas
Almost any obtainable radiation pattern can be achieved with a phased array antenna if the phases and amplitudes are chosen correctly. However, if these are quantized, it can be a time consuming and difficult process for a human expert to determine the best Quantized excitation coefficients to produce a desired radiation pattern. In this paper, we explore the use of exhaustive generation of all...
متن کامل